- Tuesday, September 3, 2024
Nvidia CEO Jensen Huang is trying to build Nvidia into a one-stop shop for all of the key elements in a data center. The strategy is designed to make the company's offerings stickier for customers. Nvidia is also building a business that supplies AI-optimized Ethernet, a business that is expected to generate billions of dollars in revenue within a year. The competition in the space is growing, with companies like AMD bolstering their data-center offerings and chip suppliers like Intel offering services and systems to help customers build and operate AI tools.
- Thursday, July 4, 2024
Nvidia's CEO Jensen Huang attributes the company's AI chip market dominance, maintaining an over 80% market share despite rising competition, to a decade-old strategic investment. Advocating for Nvidia's AI chips' cost-effectiveness and performance, Huang highlights the firm's transformation into a data center-focused entity and expansion into new markets.
- Wednesday, September 18, 2024
Nvidia's dominance in AI chips has propelled it to immense market value, largely thanks to its GPU capabilities and CUDA software ecosystem. However, competitors like AMD, Intel, Cerebras, and SambaNova are developing innovative solutions to challenge Nvidia's supremacy in AI hardware. While Nvidia's lead remains secure for now, the landscape is dynamic, with multiple players striving to carve out their own niches in the AI market.
- Thursday, August 8, 2024
Nvidia is facing increased government scrutiny from the EU, UK, China, and the US Justice Department over its dominant market share in AI chips and sales practices. The company is rapidly building its legal and policy teams to address antitrust concerns amid profitable growth, as it commands 90 percent of the GPU market essential for AI systems. Nvidia is also adapting to increased competition oversight, with recent attention turning to its planned acquisition of Run.ai and impact on the AI supply chain.
- Friday, April 19, 2024
NVIDIA's dominance in the AI space continues to be secured not just by hardware, but by its CUDA software ecosystem and proprietary interconnects. Alternatives like AMD's ROCM struggle to match CUDA's ease of use and performance optimization, ensuring NVIDIA's GPUs remain the preferred choice for AI workloads. Investments in the CUDA ecosystem and community education solidify NVIDIA's stronghold in AI compute.
- Wednesday, September 4, 2024
The US Department of Justice has sent subpoenas to Nvidia and other companies seeking evidence that the chipmaker violated antitrust laws. Antitrust officials are concerned that Nvidia is making it harder to switch to other suppliers and penalizing buyers that don't exclusively use its artificial intelligence chips. Nvidia claims that its market dominance stems from the quality of its products. The company prioritizes customers who can make use of its products in ready-to-go data centers as soon as they're provided to prevent stockpiling and to speed up the broader adoption of AI.
- Friday, April 26, 2024
Nvidia is acquiring AI infrastructure optimization firm Run:ai for approximately $700 million to enhance its DGX Cloud AI platform, allowing customers improved management of their AI workloads. The acquisition will support complex AI deployments across multiple data center locations. Run:ai had previous VC investments and a broad customer base, including Fortune 500 companies.
- Thursday, June 6, 2024
Nvidia became the second most valuable company in the world on Wednesday afternoon as its market capitalization hit $3.01 trillion. It became a $1 trillion company in May 2023, hitting $2 trillion in February this year. The company reported $14 billion in profit in May. Its AI accelerators make up between 70% and 95% of the market share for AI chips. Nvidia has plans to launch a new AI chip every year.
- Wednesday, September 11, 2024
AMD announced at IFA 2024 that it will unify its RDNA and CDNA architectures into a combined UDNA microarchitecture, aiming to better compete with Nvidia's CUDA ecosystem. This strategic move seeks to streamline development and bolster AMD's position in AI and HPC markets. The transition to UDNA is a pivotal step, with full-scale implementation expected beyond the upcoming RDNA 4 generation.
- Monday, August 12, 2024
Nvidia's upcoming GB200 server racks will be mainly cooled with liquid circulated in tubes. The company is also working on other cooling technologies, including one that involves dunking entire computers in a non-conductive liquid that absorbs and dissipates heat. Cooling accounts for a significant amount of power consumption in data centers. Liquid-cooled data centers would be able to pack much more computing power in the same space.
- Wednesday, July 17, 2024
Vultr offers a full NVIDIA GPU stack with global access to the latest technology. With 32 cloud data center locations across 6 continents, their cloud infrastructure ensures global reach, enabling enterprises to power AI and ML at the edge efficiently. The state-of-the-art lineup of NVIDIA GPUs for AI/ML, AR/VR, high-performance computing, VDI/CAD, and more includes: NVIDIA GH200 Grace Hopper™ Superchip, NVIDIA H100 & H200 Tensor Core GPUs, NVIDIA A100 Tensor Core GPU, NVIDIA L40S GPU, NVIDIA A40 GPU, NVIDIA A16 GPU. Learn more about accelerating your organization's AI initiatives with affordable access to GPUs and begin exploring Vultr with a $250 credit.
- Tuesday, August 27, 2024
Many of Nvidia's employees are millionaires because of the company's growth. Despite this, the company still has a 'pressure cooker' culture with long working hours, yelling and fighting at meetings, and company politics. Some employees work every day, including weekends, late into the night. Employees who work less than the norm are called out at company-wide meetings. The company maintains a low turnover rate, likely due to the way it gives its employees access to stock grants and its 'flat' hierarchy, which could make the company an appealing choice.
- Tuesday, August 20, 2024
AMD has agreed to buy ZT Systems, an artificial intelligence infrastructure group, for $4.9 billion in cash and stocks. The acquisition will help AMD accelerate the adoption of its AI data center chips, which compete with Nvidia's popular GPUs. The transaction is subject to regulatory approval. It is expected to close in the first half of 2025.
- Wednesday, July 10, 2024
VC firm Andreessen Horowitz has secured thousands of AI chips, including Nvidia H100 GPUs, to dole out to its AI portfolio companies in exchange for equity.
- Monday, July 22, 2024
Nvidia is developing a new AI chip, the B20, tailored to comply with U.S. export controls for the Chinese market, leveraging its partnership with distributor Inspur. Its advanced H20 chip has reportedly seen a rapid growth in sales in China, with projections of selling over 1 million units worth $12 billion this year. U.S. pressure on semiconductor exports continues, with possible further restrictions and control measures on AI model development.
- Friday, September 27, 2024
Nvidia is addressing a significant challenge in telecommunications: the strain that artificial intelligence (AI) places on wireless networks. The company believes that AI can also provide solutions to these issues through its new AI-RAN platform, which aims to enhance the efficiency and performance of mobile networks. Collaborating with partners such as T-Mobile, Ericsson, and Nokia, Nvidia is set to test this innovative approach, with T-Mobile being the first to implement AI-RAN. The AI-RAN platform is designed to utilize vast amounts of data to create algorithms that optimize network adjustments and predict real-time capacity needs. This integration of AI into the radio access network is expected to make mobile networks smarter and faster, allowing telecommunications companies to run third-party AI applications at the network's edge. T-Mobile's CEO, Mike Sievert, highlighted the transformative potential of AI-RAN, while acknowledging the challenges involved in its implementation. As AI applications, particularly those related to augmented reality and AI-powered assistants, continue to grow, there is a pressing need to manage the increasing mobile data traffic that may exceed the capabilities of current 5G networks. Traditional networks were primarily designed for voice and basic data services, but the modern landscape demands more advanced solutions to support technologies like autonomous vehicles and smart factories. Nvidia's strategy involves positioning AI-RAN as a foundational element for future advancements, including the anticipated rollout of 6G technology. The AI-RAN Alliance, which includes Nvidia, T-Mobile, Nokia, and Ericsson, is actively working to harness the potential of AI in network operations. The alliance aims to tackle the challenges posed by the massive volume of data generated by AI-driven applications. Experts emphasize that network optimization will be crucial, as machine learning algorithms will need to dynamically adjust configurations to enhance performance and manage resources effectively. This collaborative effort seeks to ensure that telecommunications infrastructure can keep pace with the evolving demands of AI and emerging technologies.
- Friday, September 27, 2024
The Vultr Cloud Alliance has formed a significant partnership with AMD to enhance high-performance artificial intelligence (AI) and high-performance computing (HPC) capabilities. This collaboration integrates AMD's advanced Instinct™ MI300X GPU accelerators with Vultr's expansive global cloud infrastructure, creating a powerful solution tailored for enterprises across various industries. AMD is recognized as a leader in high-performance computing, providing the MI300X GPUs and the ROCm™ open software ecosystem. The MI300X GPU is designed for high processing power and substantial memory capacity, making it particularly effective for complex AI models and demanding HPC workloads. The ROCm™ software ecosystem supports major AI frameworks like PyTorch and TensorFlow, facilitating flexibility and rapid development for users. The integration of AMD's technology with Vultr's infrastructure allows businesses to accelerate performance, streamline operations, and reduce costs. This partnership emphasizes a composable and flexible approach to cloud solutions, enabling enterprises of all sizes to access high-performance computing and AI capabilities without the constraints of vendor lock-in. This accessibility is crucial for democratizing AI and inference, allowing even smaller enterprises to utilize advanced technologies that were previously unattainable. The collaboration also addresses the needs of various industries, including healthcare, financial services, manufacturing, energy, media, retail, and telecommunications. By combining AMD's powerful GPUs and ROCm™ software with Vultr's scalable cloud services, businesses can tackle common challenges such as computational power, data management, and regulatory compliance. Customized solutions are provided to enhance performance and efficiency, tailored to the specific requirements of different sectors. With AMD's involvement in the Vultr Cloud Alliance Program, enterprises can leverage a unique combination of high-performance GPUs, open software, and flexible cloud infrastructure. This partnership aims to drive innovation, reduce costs, and make advanced AI and HPC solutions accessible to a broader range of businesses. Organizations are encouraged to explore the potential of this collaboration and consider how it can shape the future of cloud computing. For those interested in getting started, further information is available on the Vultr website, or potential users can reach out to the sales team for assistance.
- Friday, June 7, 2024
Nvidia became the second most valuable company in the world on Wednesday afternoon as its market capitalization hit $3.01 trillion.
- Thursday, June 20, 2024
Nvidia is now the most valuable public company in the world. Its market cap surpassed Microsoft's $3.32 trillion on Tuesday, reaching a high of $3.34 trillion. Nvidia's shares are up more than 170% so far this year. Its market cap hit $3 trillion for the first time earlier this month. Nvidia's rise has been so rapid the company has yet to be added to the Dow Jones Industrial Average, the stock benchmark of the 30 most valuable US companies.
- Wednesday, June 19, 2024
Nvidia is now the most valuable public company in the world. Its market cap surpassed Microsoft's $3.32 trillion on Tuesday, reaching a high of $3.34 trillion. Nvidia's shares are up more than 170% so far this year. Its market cap hit $3 trillion for the first time earlier this month. Nvidia's rise has been so rapid the company has yet to be added to the Dow Jones Industrial Average, the stock benchmark of the 30 most valuable US companies.
- Monday, April 15, 2024
The next big narrative in crypto might be centered around GPU and cloud computing infrastructure, driven by the growing demand for artificial intelligence training and the asymmetry between rapidly advancing software and the slower pace of hardware development. Sam Altman's plan to raise trillions to accelerate chip manufacturing, the potential reunification of China and Taiwan, and the upcoming io.net token generation in April could catalyze interest in this narrative. Numerous projects in this sector could capitalize on this “GPU is the new oil” sentiment.
- Tuesday, September 3, 2024
Nvidia's new Blackwell chip demonstrated top per GPU performance in MLPerf's LLM Q&A benchmark, showcasing significant advancements with its 4-bit floating-point precision. However, competitors like Untether AI and AMD also showed promising results, particularly in energy efficiency. Untether AI's speedAI240 chip, for instance, excelled in the edge-closed category, highlighting diverse strengths across new AI inference hardware.
- Friday, March 15, 2024
NVIDIA co-founder Curtis Priem has donated $275 million to Rensselaer Polytechnic Institute (RPI), impacting its technological advancements and allowing it to house an IBM Quantum System One computer. He gave away his NVIDIA shares after the IPO, valuing meaningful contributions over wealth retention. Priem's philanthropy has been pivotal in enhancing RPI's academic and research infrastructure.
- Monday, March 11, 2024
Nvidia is discontinuing its Turing-based GTX GPUs, moving towards exclusively branding its gaming graphics cards under the "RTX" lineup. The transition signifies a shift away from the GTX series in favor of cards that support advanced features like ray tracing. The GT series may persist, but the GTX line is on its last legs as stocks deplete.
- Monday, April 22, 2024
NVIDIA will power Japan's new quantum supercomputer, ABCI-Q, alongside Fujitsu, integrating 2,000 NVIDIA H100 AI GPUs and CUDA-Q platform for quantum-classical computing applications. The project aims to advance Japan's capabilities in quantum computing and AI. This collaboration is part of a broader technological partnership between NVIDIA and Japan.
- Monday, September 2, 2024
Nvidia's Blackwell chips are about twice as big as its predecessors, housing 2.6 times the number of transistors. Instead of one big piece of silicon, Blackwell consists of two advanced processors and numerous memory components joined in a single, delicate mesh of silicon, metal, and plastic. The manufacturing of each chip has to be close to perfect, presenting engineering challenges that have a sizable impact on the bottom line, with each defect rendering a $40,000 chip useless. This article looks at some of the challenges Nvidia had to overcome to produce the chip.
- Tuesday, June 4, 2024
AMD unveiled its latest AI processors, including the MI325X accelerator due in Q4 2024, at the Computex trade show. It also detailed plans to compete with Nvidia by releasing new AI chips annually. The MI350 series, expected in 2025, promises a 35-fold performance increase in inference compared to the MI300 series. The MI400 series is set for a 2026 release.
- Friday, April 5, 2024
GPU provider Lambda has a special debt financing deal for $500m to expand its GPU cloud offering in addition to the $230m Series C earlier this year.
- Monday, June 3, 2024
Nvidia has unveiled a new generation of artificial intelligence chip architecture called Rubin. The company only just announced its upcoming Blackwell model in March - those chips are still in production and expected to ship to customers later in 2024. Nvidia has pledged to release new AI chip models on a one-year rhythm. The less-than-three-month turnaround from Blackwell to Rubin underscores the competitive frenzy in the AI chip market.
- Friday, August 30, 2024
Apple and Nvidia are in talks to invest in OpenAI as part of a fundraising round that would value OpenAI at above $100 billion. It is an unusual move for Apple, as the company doesn't usually invest in startups. Nvidia has stepped up its investment activity in the past two years, putting its money into AI-related companies. OpenAI is one of the largest users of Nvidia's AI chips.